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Abstract. Half-space depth (also called Tukey depth or location depth) is one 
of the most commonly studied data depth measures because it possesses many 
desirable properties for data depth functions. The data depth contours bound 
regions of increasing depth. For the sample case, there are two competing 
definitions of contours: the rank-based contours and the cover-based contours. 

In this paper, we present three dynamic algorithms for maintaining the half- 
space depth of points and contours: The first maintains the half-space depth 
of a single point in a data set in O(logn) time per update (insertion/deletion) 
and overall linear space. By maintaining such a data structure for each data 
point, we present an algorithm for dynamically maintaining the rank-based 
contours in 0(n ■ logn) time per update and overall quadratic space. The 
third dynamic algorithm maintains the cover-based contours in 0(n ■ log 2 n) 
time per update and overall quadratic space. 

We also augment our first algorithm to maintain the local cover-based con- 
tours at data points while maintaining the same complexities. A corollary of 
this discussion is a strong structural result of independent interest describing 
the behavior of dynamic cover-based contours near data points. 

Key words: Data depth, Half-space depth, Dynamic algorithm, Data struc- 
ture 



1. Introduction 

Data depth is a statistical analysis method that assigns a depth value to a point 
c C M. d that measures how deep (or ce ntral) x i s relative to a probab ility dis- 



Il999l ]). Data 



dept h has recently at tracted attention from the computational geometry commu- 



nity |Liu et allliOQ^ . It does not require a priori assumptions on the underlying 



probability distribution of data and handles o bservations that deviate from the data 
(outliers). Half-space depth, first defined by [Tukevl . Il975t iHodgesl . Il95% is a data 



depth measure that has been commonly studied in recent years, e. g., 
Il999t iBremner et ail . 120081 lAloupis et all 120021 iMiller et all . |2003| 



Liu et al 



Dep t h con 



tours are nested contours that enclose regions of increasing depth [Tukevl . fl975j| 
They provide a powerful tool to visuali ze, quantify, and compare data sets (e.g., 
Liu et all ll999HRousseeuw et allll999j ). The statistics community has produced 
two competing definitions for depth contours, yi elding distinctive contours. The 
two main approaches were termed cover and rank [Rafalin and Souvainei . I2004J ] , see 
Figure [T] In this paper, we present dynamic algorithms in R 2 for maintaining the 
half-space depth of a data point in O(logn) time per update, the rank-based con- 
tours in O(rtlogn) time per update, and the cover-based contours in 0(nlog n) 
time per update. 
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The half-space depth (sometimes called the location depth or the Tukey depth) 
of x G R d relative to a data set J- n = {Xi, ■ ■ ■ ,X n } is the minimum fraction of 
points of J- n lying in any closed half-space containing x. In R 2 , the half-space 



depth of a single point can be computed in optima l Q(n log n) time [Aloupis et al 



2002 ; Ruts and Rousseeuw , 1996b ; Bremner et al. . 20081 and th e half-space depth 



of all points can be computed in Q(n ) time Miller ct al., 200 31 . In higher dimen 



sions. iFukuda and RostaL 12003 : Bremner et all l2008t iRousseeuw and Struyi 1998t 



* 1 1 | ^ ■ j ■ I I, , ^ i i , 

Struvf and Rousseeuwl . |2000| present algorithms to compute the half-space depth 



of a single point. The half-space median is a point of maximal half-space depth (if 
more than one point has maximal depth, then the center of mass of all such points 
is commonly used); in R d , the half-space depth of the median is at least ^Xi l n 
by Helly's theorem [Eckhoff, 1993] and at most \n/2~\/n (for data sets without dou- 
ble p oints). In R 2 , a point with m aximal depth can b e computed in 0(n\og n) 
time [Langerman and Steiger . l2003| . and [Chanl . l2004| presents a randomized al- 
gorithm to compute the median in 0(n log n) expected time. Chan's algorithm 
exte nds to higher dimensi o ns with an expected tim e of O ( n d ~ 1 ). For other results , 



see 



Ruts and Rousseeuw . 1996a , Il998j ; iMatousekl ange rman and Ste iger 



2000( | . A center point is a point with depth at least jn. In the plane, 



Jadhav and Mukhopadhvavl 1 1994} present an O(n) algorithm for computing a cen- 



ter point. 



Agarwal et al 



For othe r results, 
2008j 



see 



Cole et all 11987: Matousek, 119911 iColel 11987 



The sample cover contour of depth d/n [Tukev , 1975j is the boundary of all points 
(not necessarily from the data set) with depth at least d/n. The idea behind the 
cover-based contours is based on the assumption that the data set is generated from 
an underlying distribution and that the contours should represent the behavior of 
the underlying distribution. The sample rank contour of the ath centr al region is 



the c onvex hull containing the a most central fraction of data points [Liu et al 
1999]. The rank-based contours focus more on the data points themselves because 



data points may represent actual observations, see Figure [TJ Therefore, only data 
points can be vertices of rank-based contours, and they may appear on multiple 
contours. On the other hand, non-data points from R d can be vertices of cover- 
based contours and each data point will occur on a single contour (for data sets in 
general position). For sufficiently nice underlying distributions, both types of con- 
tours will, al most surely, approach th e half space depth contours for the underlying 
distribution Zuo and Serflingl . l2000j ] . Although the rank-based contours lack some 
of the structure of the cover-based contours, they often provide a reasonable and 
less computationally expensive contour. In R 2 , any half-space depth cover-based 
contour edge lies on a segment between two data points. At most ("J such segments 
exist, thus the complexity of the collection of half-space cover contours is 0(n 2 ). 
This bound is tight, and it is achieved, for example, when all points of the data set 
lie on the convex hull of the data set. Rank-based depth contours, by definition, 
enclose the 1st, ^^-th, . . . , —th sample regions, have a total complexity of 0(n 2 ) 
(once data points of the same depth have been arbitrarily ordered), but only in- 
volve O(n) distinct edges since many contours share edges. (The bound on the 
number of edges can be proved by amortizing the complexity over the data points 
since the difference between succ essive contou r s is in duced by removing a point and 
its two incident edges.) In R 2 , Miller et all 120031 ] present an optimal algorithm 
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that computes all of the cover-based contours in 0(n 2 ) time; at the same time, it 
computes the depth of every data point, and, therefore, can also be used to com- 
pute rank-based cont ours. Several implementation s exist for computing cover-based 
contours: isodepth IRuts and Rousseeuwl. Il99 6bl can compute the fc th contour in 
0(n 2 logn) time. FDC Johnson et al. . 1998} computes the outermost k contours 
and outperform s isod epth for small k. Using hardware assisted computation, 



Krishnan et all 12002a compute the half-space depth cover-based contours for dis- 



play. Chanl . l2004j | presents an algorithm to compute the cover- based depth contour 
of de pth k in 0(n log 2 n) expected time. In higher dimensions, [Fukuda and Rosta . 



2003| present an algorithm for the computation of these contours. 



(a) (b) 

Figure 1. Half-space depth: (a) All cover-based contours for 50 
points from a bivariate normal distribution with mean (0,0) and 
covariance matrix 47; (b) The 10%, 20% . . . 100% rank-based con- 
tours for the same data set. Both sets of contours have similar 
shapes, but are composed of distinct polygons: for example, the 
cover-based contours do not share vertices while the rank-based 
contours do share vertices. 



Few dynamic algorithms (allowing the insertion or deletion of data points) for 
data depth computati ons exist, despite th e interest of the statistical community 



Liu], l2003j . Recently, [Abbott et all 120091 ] presented a dynamic algorithm which 



can be used to maintain a point of half-space depth at least 1/4 and also a dy- 
namic algorithm which dea ls with the case where the convex hull peeling depth 
Barnettl. Il976t lEddvl. Il982l of a data set is bounded Another result, presented 



Agarwal and Matousekl |1995| , dynamically maintains a data structure for half- 
space queries; this solves a different, but related, problem than to maintain the 
half-space depth of a point. For cover contours, our algo rithm is essentially main- 
tainin g t he fc-belts of an arrangement, a s introduced in Edelsbrunner and Welz! 
1986j . In Edelsbrunner and Welzll . Il986l | , the authors construct the levels of an ar- 



rangement using a sweepline and claim, in passing, that the algorithm can be made 
dynamic. While their algorithm is related to ours since we are solving the same 
problems and use similar techniques and tricks, it is difficult to compare the algo- 
rithms: the authors neither provide an explicit version of their dynamic algorithm 
nor any claims as to its complexity. 
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This paper presents the first dynamic algorithm for computing the half-space 
depth contours in R 2 of a data set of n points in general position. In Section 2, 
we review some properties of half-space depth and half-space depth contours. In 
Section 3, we present an algorithm for maintaining the half-space depth of a single 
point in O(logn) time per operation (insertion or deletion) and l inear overall space . 



The data structure that we use was originally presented in [van Kreveld et al, 
1999], but we augment it to enable dynamic updates. In Section 4, we expand the 
algorithm and augment the data structure in Section 3 not only to maintain the 
depth of a point, but also to maintain the cover-based contours near points that 
do not lie on degenerate contours, i.e., contours consisting of either a single point 
or two points and the segment between them. Through this analysis, we present a 
theorem of independent interest that shows that the cover-based contours cannot 
change too much after an insertion or deletion. In Section 5, we use the algorithm 
from Section 3 (or from Section 4, since that algorithm is an augmented version 
of the algorithm from Section 3) to develop an algorithm to maintain the rank- 
based half-space depth contours in 0(n log n) time per update and overall quadratic 
space. In Section 6, we present an algorithm and data structure to maintain the 
cover-based half-space depth contours in 0(n log 2 n) time per operation and overall 
quadratic space. Both of these algorithms represent improvements over the static 
running time of 0(n 2 ). Finally, we conclude in Section 7. 

2. Half-Space Data Depth and Contours 

In this section, we review the basic definitions and properties of half-space depth 
and its contours. In this direction, we provide alternate combinatorial descriptions 
of these definitions which will be useful throughout this paper. For example, the 
global description of the contours will be used in Section |6l and the discussion of 
the description of the local half-space depth contours near a data point will be 
completed in Section S) All of the half-spaces considered in this paper are closed. 

Definition 2.1 ( [Tukevl . 1 19751 ]). The half-space depth of a point p with respect to 
a data set J- n — {Xi, • • • , X n } is the smallest fraction of data points contained in a 
half-space containing p, i.e., min{(#£f n F n )/n\p £ H a half-space}. It is sufficient 
to consider only half-spaces that include p in their boundary, i.e., min{(#_ff n 
F n )/n\p £ dH, H a half-space}. 

There is some tension between how half-space depth is computed in the statistics 
and probability community and in the computational geometry community. The 
statistics and pro bability viewpoi nt defines half-space depth in terms of probabilities 



over half-spaces Liu et al. , 19991 ]. The computational geometry viewpoint prefers 



to count the number of data points in a half-space. In our definition, we choose to 
represent the half-space depth as the fraction, instead of the number, of points of J- n 
in the half-space H because this definition more accurately reflects the relationship 
between the finite sample ca se and the continuous case, i.e., the definition for 
distributions Liu et al. , ll999j . Even though we use this definition in our theoretical 
proofs, in our algorithms, the standard computational geometry definition becomes 
more desirable to maintain our time complexities. 

Half-space depth has many nice properties. The half-space median is robust in 



the presence of outliers because Helly's theorem implies that at least 



of the 



d+l 

data points in J- n must be outliers to alter the position of the median significantly. 
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Half-space depth has all of the desirable properties for a data depth function: afhnc 
invariance, maximality at the center, monotonicity relative to t he deepest point 



vanishing at infinity, a nd invariance under dimensions change [Zuo and Serfling , 
l200d: iBurr etalll2006j . 

2.1. Half-Space Depth Contours. There are two main definitions for data depth 



contours called the cover-based contours and the rank-based contours Rafalin and Souvaine 



2004], and, for most data sets, they provide distinct contours. 



Definition 2.2 (Cover-based contours [Tukevl Il975j ). The cover-based contour 



of depth t is the boundary of {x £ R d | Half-space depth of x is at least t}. This 
approach constructs a contour around all points of K d (not necessarily from the 
data set) with half-space depth at least t. 

A simple, albeit inefficient, way to compute the cover-based contour of depth 
t in R d (which will be justified below) is as follows: first, divide the plane into 
cells determined by hyper-planes through d data points. The half-space depth is 
constant in each of these cells. Second, pick a point in each cell (which cannot be 
a data point), and compute its half-space depth. Finally, take the closure of the 
union of all cells with half-space depth at least t; the boundary of this region is the 
cover-based contour of depth t. 



Definition 2.3 (Rank-based contours Liu et al. . 1999} ). The rank-based contour 



of the ath central region is the convex hull of the a most central fraction of data 
points where ties are broken arbitrarily. This approach constructs a contour around 
the deepest data points. 

A simple way to compute the rank-based contour of the ath central region is 
to first compute the depth of every data point and sort the data points by depth 
(in this sorted list, ties are broken arbitrarily). Then, take the n ■ a data points 
with largest half-space depth (with respect to the arbitrary ordering) and compute 
the convex hull of this set. This convex hull is the rank-based contour of the ath 
central region. 

Both contour definitions can be adapted to any data depth function by the 
appropriate change in the definition. In this generality, both contour definitions 
have their merits, for example, the rank-based contours are always convex and 



nested while the cover-based contours have more structure, see iBurret al., 200 1 



For half-space depth, in particular, the cover-based contours can be described in 
terms of a certain finite collection of half-spaces determined by the data points. On 
the other hand, the rank-based contours have no such description. In the remainder 
of this section, we recall this description. We begin with the following assumption: 
for the remainder of the paper, we assume that all data sets are in general position. 

The definition of half-space depth is a minimization problem over all half-spaces 
that contain a given point p. As is usual in this setup, there is a dual problem that 
describes all points of a given half-space depth, as in the following proposition: 

Proposition 2.1. The cover-based contour of depth k/n in M. d is the boundary of 
the intersection of all closed half-spaces containing exactly n — k + 1 data points. 



This result can, in fact, be improved to only consider a finite number of half- 
spaces. 
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Corollary 2.1. The cover-based contour of depth k/n in R d is the boundary of the 
intersection of all closed half-spaces containing exactly n — k + 1 data points and 
whose boundary contains exactly d data points. 

These results are not difficult, but they both require some technical, linear 
algebra- type lemmas. For example , one way of pro ving these equivalences is to 
use translations and rotations, see Cole et all . 1987 for a sketch. To translate a 



given half-space means to find a second half-space whose bounding hyperplane is 
parallel to the bounding hyperplane of the original half-space and both half-spaces 
open in the same direction. To rotate a given half-space around a subflat of the 
bounding hyperplane means to find a second half-space where the bounding hyper- 
plane includes the given subflat (alternately, this rotation can be interpreted as a 
point in the appropriate Grassmannian) . Note that Corollary 12 . II does not say that 
the d data points lie on the contour of depth k/n, only that the faces of the contour 
of depth k/n lie in hyper-planes that each contain d points, but these points may 
be far from the contour itself, e.g., see Figures QJa) and [21(a). 

In addition, Corollary 12.11 provides an explicit way to compute the half-space 
depth of a data point by investigating a finite number of half-spaces. In particular, 
we will use the following corollary (in 2-dimensions) to motivate our data structures: 

Corollary 2.2. If p is a data point of J- n in Mr, then to compute the half-space 
depth of p it is sufficient to consider half-planes whose boundaries include both p 
and d — 1 other data points of T n , i.e., min-f^-ff n T n — (d — l))/n\#J-' n n dH = 
d and H a half-space} . 

Note that the —{d— l)/n appears in the formula because a small rotation can 
remove the (d — 1) data points, other than p, from dH. Since every data point 
appears on some contour, Corollary 12.21 implies the following corollary which is an 
initial description for the local cover-based contours near a data point: 

Corollary 2.3. If p is a data point of half- space depth k/n in M. d , then the edges 
of the cover-based contour of depth k/n that are incident to p are defined by planes 
through p and d — 1 other data points. 

2.2. Terminology. Our discussion above has highlighted the data which we need 
in order to maintain the rank-based or cover-based contours. In this section, wc 
define the terminology for the half-spaces and their associated geometric objects 
which will be used through the remainder of this paper. We begin with the following 
assumption: for the remainder of this paper, we restrict our attention to R 2 . 

For most cover-based contours, the contour bounds a region, but, for the deepest 
contour, it is possible that the contour does not bound a region, but consists of either 
a single point or a segment. Our algorithm must treat these degenerate contours 
as a special case. 

Definition 2.4. A cover-based contour is degenerate if it consists of either a single 
point or of two points connected by a segment, see Figure [2ja). 



It is easy to show using Corollary 12 . 1 1 that the only time that a degenerate cover- 
based contour consists of a segment is when the data set has exactly two points. 
In non-degenerate cases, the half-space depth contours bound a region and we now 
define the features of these regions locally near a data point: 
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Figure 2. (a) Degenerate half-space depth contours: A set of 
6 points. The outermost contour is the convex hull of the data 
set; the next contour contains 5 vertices, none a point of the data 
set. The inner-most contour contains one data point, (b) The 
transformation of the half-planes determined by q and p. The bold 
semicircle represents all half-planes whose number of points will be 
incremented (decremented) by one when q is inserted into (deleted 
from) the data set because q is to the right of the associated vector. 

Definition 2.5. The defining edges (lines) of data point p of half-space depth 
k/n with respect to the data set J-" n , whose cover-based contour of depth k/n is 
non-degenerate, are the two edges (lines defined by the edges) incident to p on 
the cover-based contour of depth k/n. We will use the notation l\ and I2 for the 
defining lines. 

Each of the defining lines for a point p determines two half-planes, one of which 
contains n — k + 1 data points while the other contains k + 1 data points. The half- 
plane containing k + 1 data points is important because it describes the contour of 
depth k/n locally and provides a witness for the depth of p, see Figure |U 

Definition 2.6. The defining half-planes Hi t ,Hi 2 for data point p of depth k/n 
with respect to !F ni whose cover-based of depth k/n contour is non-degenerate, are 
the closed half-planes bounded by the defining lines of point p with respect to T n 
and which contain k + 1 data points. 

Corollary [271] implies that the defining lines for a data point p must pass through 
an additional data point. For our dynamic algorithms, it makes sense to keep track 
of all possible half-planes which could be new defining half-planes after an insertion 
or deletion, i.e., all half-planes whose boundary includes p and an additional data 
point. 

Definition 2.7. The meaningful half-planes are the half-planes whose bounding 
line passes through two data points. Let p be a data point, then the meaningful 
half-planes with respect to p are all of the meaningful half-planes whose boundary 
includes p. Note that there are 2(n — 1) meaningful half-planes with respect to p 
since every line determines two half-planes. 

Since is is not enough to use lines to keep track of meaningful half-planes, we 
choose to represent half-planes as (unit) vectors or, equivalently, the points on the 
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unit circle centered at p where the unit vectors terminate. The half-plane associated 
with a vector v centered at p is the half-plane consisting of all data points to the 
right of the vector, i.e., all points q such that the clockwise angle between v and the 
vector v q , the vector from p to g, is less than or equal to 7T, see Figures [2jb) and [3] 



r 3to r 2to 




Figure 3. The transformation of the meaningful half-planes with 
respect to p: (a) Point q is being inserted into (or deleted from) 
the data set. (b) The vectors represent the two half-planes defined 
by each data point and p. The half-plane associated with a vector 
consists of all points to its right so that the shaded region represents 
the set of half-planes which contain q. In particular, the shaded 
region represents the set of half-planes whose number of included 
points increases (decreases) when q is inserted into (deleted from) 
the data set. On the other hand, q, rj., and r$ are in the half-plane 
associated with the vector pointing towards q. 



3. Dynamic Maintenance the Depth of Data Points 

In this section, we present an algorithm to maintain dynamically the half-space 
depth of a data point in 0(log n) time per insertion or deletion and linear space 
overall. This is an important step in the maintenance of the rank-based contours 
because these contours are based on the depths of the data points. One of the 
simplest 0(nlogn) algorithms for computing the ha lf-space depth of a p oint p, not 



necessarily a data point, is the following algorithm [Aloupis et all 12002] : 

First, sort all of the data points radially around p. Next, use two pointers to 
step through this list. The two pointers are separated by an angle of tt to represent 
a half-plane passing through p. Finally, keep track of the smallest fraction of data 
points between the two pointers; this value, minus 1/n, is the half-space depth of 
p (the — 1/n appears as in Corollary [ 



In this section, we turn this static algorithm into a dynamic algori thm. The static 



versio n of the data structure that we use was originally presented by [van Kreveld et al 
1999] for a different problem. 

3.1. The Angular Transformation. The first step in the static algorithm above 
is to sort the data points radially around data point p. In the dynamic algorithm, 
we instead need to maintain the sorted list of data points. In the algorithm above, 
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we needed two pointers to find the depth of p; therefore, our data structure will 
maintain all of the meaningful half-planes with respect to p. We represent the half- 
planes as unit vectors originating at p, as described in Section [521 Since every data 
point, other than p, defines two meaningful half-planes, we label the vector pointing 
towards the point "to" and the vector pointing away from the point "away," see 
Figures Mb) and El 

The advantage of this representation is that the updates to the count of points 
in meaningful half-planes after an insertion or deletion are relatively simple to 
describe. In fact, when a point q is inserted into (deleted from) the data set, 
the only half-planes that need to be adjusted are those in the semicircle counter- 
clockwise from q to to qway, see Figure [2Kb). In the case of a circle, this semicircle 
is an arc and can be used to update the number of points in the half-planes in 
sub-linear time. 

3.2. Data Structure for Updating the Half-Space Depth of a Data Point. 

In order to recompute the depth of point p after an insertion or deletion, first, 
the number of data points in every meaningful half-plane with respect to p has to 
be recomputed. Then, the minimum number of data points in these meaningful 
half-planes must be computed. If this number is k + 1, then the depth of p will be 
k/(n ± 1) where the ± corresponds to the insertion or deletion of a data point. To 
recompute the depth of p efficiently, all meaningful half-planes with respect to p are 
updated simultaneously by increasing or decreasing the number of points associated 
with all affected half-planes by one. 

We require a data structure that maintains all of the meaningful half-planes for 
a given point p and which allows insertions and deletions in O(logn) time. The 



static version of this data structure was originally presented in [van Kreveld et al. . 
.1999] for a different problem. We begin by describing the operations which are 
necessary for a data structure to solve this problem and then discuss our particu- 
lar implementation (in Section 4, we will augment this data structure in order to 
maintain more of the local structure of the depth contours). 

Let T be the data structure representing the set of meaningful half-planes with 
respect to p. Each meaningful half-plane is labeled by a non-negative integer Depth, 
which is the number of data points in the half-plane. 

• Increment (T, left, right), Decrement (T, left, right) - Given a counter- 
clockwise circle segment defined by its two end-directions (left and right), 
increment or decrement the Depth of all vectors within the circle segment. 

• Insert (T , q' , k' , to) - Given a new vector g', the number of data points 
k' in the half-plane corresponding to q' ', and a boolean value to recording 
if q' points towards a data point, update the data structure to include q' . 

• Removed, q') - Given a pointer to an existing vector q' , delete q' from the 
data structure. 

• MinDepth(T) - Return the minimum number of points in a meaningful 
half-plane associated with p. 

3.2.1. Detailed Operations. In our implementation, the data structure is an aug- 
mented dynamic balanced binary search tree representing the meaningful half- 
planes associated with p. The leaves of this tree correspond to the meaningful 
half-planes and therefore the height of the tree will be O(logn). Each meaningful 
half-plane will be represented in a leaf by its corresponding point on the unit circle 



10 MICHAEL A. BURR, EYNAT RAFALIN, AND DIANE L. SOUVAINE 

in the range [0, 2n), i.e., the counter-clockwise angle from the positive horizontal 
vector to the vector associated with the half-plane. The leaves of the tree are sorted 
by their direction in the interval [0, 2n). 

The leaves of the tree are augmented with three fields: the direction, Depth, 
and to fields. The direction is the value in [0, 2ir) corresponding to the vector 
associated with the meaningful half-plane. The Depth is the number of data points 
in the represented meaningful half-plane. The to is a boolean value that is true if 
the vector corresponding to the meaningful half-plane points towards a data point 
and false if it points away. The internal nodes of the tree are augmented with three 
fields: the minDepth, maxDepth, and subtree-addition fields. The minDepth and 
maxDepth fields of a node contain the minimum and maximum Depth values of all 
leaves in its subtree. The subtree-addition value is a number by which all of the 
leaves of the node should be incremented or decremented. The actual updates are 
deferred until a traversal of this tree is necessary. 

To perform an Increment (T, left, right) or Decrement (T, left, right) 
the Depth values for leaves or the subtree-addition values for internal nodes 
are incremented or decremented to represent the change in all leaves in a counter- 
clockwise direction from left to right. This can be performed by traversing the 
paths from the leftmost leaf to the rightmost leaf of the interval from left to 
right. While traversing these paths, the appropriate Depth or subtree-addition 
values of the children of this path between left and right are incremented or 
decremented. In addition, the minDepth and maxDepth fields of all ancestors of q' 
may be adjusted as well, if necessary. 

AnInsert(T, q' , k' , to) or Removed, q' ) can be performed with a standard 
dynamic binary tree insert or delete along with an adjustment of the minDepth and 
maxDepth fields of the ancestors of q', if necessary. This adjustment can be per- 
formed by traversing a path from the root of the tree to q' . The MinDepth (T) can be 
performed by examining the root of the tree: the minDepth value of the root main- 
tains the half-space depth of p, i.e., the half-space depth of p is (minDepth(root) — 
l)/n. 

Using these operations, an insertion or deletion of a data point q is straight- 
forward. Let T be the tree and q to be the vector from p pointing towards q and 
assume that the angle it makes counter-clockwise from the positive horizontal vec- 
tor is 7. In addition, let q away be the vector from p pointing away from q. The 
half-planes that contain q are those with direction in the interval [7,7 + ir] (or 
[7, 2tt) U [0, 7 — 7t] if 7 > 7r). To insert q, first perform Increment (1,7,7+ 7r ) ( or 
the two increments on the intervals described above) and then perform Insert (T, 
q to , k, true) and Insert (T, q away , n — k + 2, false) , where k is the number 
of points in the half-plane corresponding to q to and can be computed easily by con- 
sidering the Depth and to fields of a neighbor of q to in the tree. To delete q, first 
perform Decrement (1,7,7+ 7r) ( or the two decrements on the intervals described 
above) and then perform Remove (T, q to ) and Removed, q aW ay) ■ 

The tree, which stores the 2(n — 1) vectors corresponding to meaningful half- 
planes with respect to p, has linear structure and all operations require logarithmic 
time. Since only a single region is incremented or decremented, the dynamic tree 
on the set of vectors allows associated values to be updated in logarithmic time. 
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Theorem 3.1. An algorithm exists that can dynamically maintain the half-space 
depth of a data point relative to a dynamic set of n points in 0(log n) time per 
operation and linear space. 

Proof. To maintain the half-space depth of a single point p the minDepth field of the 
root needs to be considered after every update. The depth of p is (minDepth(rooi) — 
l)/n. □ 

4. Dynamically Updating Local Cover Contours 

In this section, we augment the data structure from Section 3 to maintain the 
defining half-planes for the point p. These half-planes provide a witness to the 
depth of p. In addition, knowing the defining half-planes for p provides a constant 
time answer for the depth change query: if q is inserted into or removed from 
the data set, will the half-space depth of p change? We begin by discussing six 
properties of the half-space depth cover contours and meaningful half-planes with 
respect to the data point p. After this, we analyze various cases of insertion or 
deletion with respect to the point p. These cases prove Theorem 14. 1[ a theorem of 
independent interest, that shows that defining half-planes cannot change too much 
after an insertion or deletion. Since this section deals with the defining half-planes, 
which are not defined for degenerate contours, all of our data points must lie on a 
non-degenerate contour both before and after an update. At the end of this section, 
we describe a test which can determine if a point is on a degenerate contour: it 
amounts to failing the counting property, Property 02 below. 

4.1. Properties of the Dynamic Contours. The transformation in Section 3 
allows us to present 6 important structural properties for the meaningful half- 
planes. These properties are the keys to our algorithm for updating the defining 
half-planes of individual points. Because these properties are technical, we will 
fix some notation in this section. The angle of a vector used to represent a half- 
plane is measured counterclockwise from the positive x-axis. The notation Ei ± will 
represent the cover-based contour edge that lies on the line Zi. A half-plane will be 
written Hi where I is the boundary line of this half-plane; although this notation 
is not unique, the half-plane chosen will be clear from context. A vector will be 
written as vh, , where Hi is the half-plane represented by the vector, or as qto and 
Qaway when the vectors are associated with the data point q, see Figures IHa) and 
[3] For these properties, w.l.o.g., assume that p is the origin. 

Property 1. Convexity property: This property simplifies the following ar- 
guments and notation by fixing a standard orientation for a data point and its 
associated contour: Every half-space depth contour is convex, constraining the an- 
gle between the two vectors associated with the defining half-planes, and Hi 2 . 
If edge Ei 2 is adjacent to Ei x clockwise along the contour of depth k/n with Vh h 
pointing towards the positive rc-axis and if the interior of the contour lies immedi- 
ately above the positive x-axis, then vh 1o points into the lower semicircle and the 
angle of vh 12 is <j> € (ir, 2n), see Figure 01J a). 

Property 2. Insertion Property: This property describes the updates needed 
as points are added to or deleted from the data set: When a data point q is added 
to (deleted from) the data set, the number of data points increase (decrease) by 
one in all half-planes that contain q: those whose vectors lie in counter-clockwise 
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interval from qto to q a way- If Qto has angle 7 then the affected half-planes have 
angles [7, 7 + ir], see Figure EJb). (If 7 > ir, then, for the computations below, it is 
easier to write this interval as [7, tt) U [0, 7 — 7r].) 



Figure 4. (a) The convexity property: The four regions defined 
by the defining lines for p, and the angle to q. The vectors v% 
and V2 are the vectors associated with the defining half-planes, 
and l\ and I2 are the defining lines. In addition, this contour is 
in the standard position by the convexity property since Vi points 
along the positive x-axis and v 2 points below the x-axis. (b) The 
counting property: The number of points in regions described in 
the counting and converse counting properties. For example: If the 
vector associated with a half-plane points into the interval (0,0), 
then the half-plane contains at least fc+1 data points. If the vector 
associated with a half-plane points into the interval {<f>, 2n) and its 
bounding line passes through two data points, then the half-plane 
also contains at least k + 1 data points. 

Property 3. Counting Property: This property describes the minimal number 
of data points in any half-plane with respect to the local depth contour at a data 
point: Let H m be a half-plane where m passes through data point p with half-space 
depth k/n. Then, if H m intersects the interior of the cover-based contour of depth 
k/n, then it must contain > k points and if it does not intersect the interior of the 
cover-based contour, then it must contain > k points, see Figure|4tb). In particular, 
assume that the defining half-planes for p are Hi t and Hi 2 and that they satisfy the 
orientation with angle <f> as described in the convexity property, Property Q] For 
any unit vector vn m with angle /3, consider the number of data points in its the 
associated half-plane. 

• If < j3 < <f), then the half-plane H m intersects the interior of the cover- 
based contour of depth k/n and must contain > k data points; if m passes 
through p and a second data point, then H m must contain > k + 1 data 
points. Otherwise (for both cases) there would exist a point (witnessed 
possibly via a rotation) in the interior of the contour of depth k/n with 
depth less than k/n. 




(a) 



(b) 
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• If = (3 or <j) — f3 then H m is a denning half-plane for p and contains 
exactly k + 1 points. 

• If </> < /3 < 2ir, then H m contains > k data points; if m passes through p 
and another data point, then H m contains > k + 1 data points (otherwise 
p would have depth < k/n). 

Property 4. Converse counting property: This property shows that if the half- 
planes whose bounding line passes through the data point p follow the conclusions 
of the counting property, then the depth of p must be k/n: Assume that there is 
a continuous region on the unit circle, B, containing vectors whose associated half- 
planes have > k + 1 data points; the half-planes defined by p and another data point 
whose vectors are in B contain > k + 2 points. The half-planes whose associated 
vectors are on the boundary of B must have 2 points on their bounding line and 
their half-planes contain exactly k + 1 points. The half-planes whose vectors are in 
the complementary region B c contain > k points; the half-planes defined by p and 
another data point whose vectors are in B c contain > k + 1 points. Then point p 
must have depth k/n. Since every half-plane contains k points, the half-space depth 
of p is at least k/n. Let H m be one of the half-planes forming the boundary of B 
and q the data point not equal to p on m. By a rotation of H m , a second half-plane 
can be found which contains all data points except for q. This new half-plane will 
therefore contain p and k data points. Therefore, the depth of p is k/n, see Figure 
lib). 

Property 5. Location property: Given two candidate defining half-planes, i.e., 
ones which contain the correct number of data points, this property describes the 
possible positions for the defining half-planes: Given two arbitrary half-planes H mi , 
H m2 whose bounding lines each pass through point p (of depth k/n) and one other 
data point and which each contain k+1 data points, then the vectors for the defining 
half-planes are restricted to lie in two small intervals, see FigureJSJa). Assume Vji m 
points towards the positive x-axis and Ujj m2 points into the lower semicircle with 
angle /3. By the converse counting property, Property[4j the angles of the associated 
vectors of the defining half-planes must be in the interval [0,/3]. Alternately, if an 
associated vector for a defining half-plane makes an angle in the interval \J3 — 7T, it], 
then the union of this half-plane, H mi , and H m2 is the entire plane, contradicting 
the assumption that p is on a non-trivial contour. Therefore, the angles of the 
associated vectors for the defining half-planes must come from [0, (3 — ir) and (ir, ft], 
one vector from each interval because, by the counting property, Property [3l H mi 
and H m2 must lie in Region 2 from Figure H^a). 

Property 6. Nestedness property: This property shows that the contour re- 
gions can only expand when a point is added to the data set: When a point q is 
added to the data set, the half-space depth cover-based contour of depth k/n rela- 
tive to T n is nested in the half-space depth cover-based contour of depth k/(n + 1) 
relative to T n U {q}. When q is added, the depth of all half-planes either increases 
or remains the same. Thus, the depth of every point either increases or remains 
the same. Assume p lies on a nontrivial contour with defining half-planes Hi t and 
Hi 2 and the depth of p remains unchanged when q is inserted. In addition, assume 
that the defining half-planes for p satisfy the orientation described in the convexity 
property, Property [1] Then, the angles of the associated vectors for the defining 
half-planes after q is inserted must be in the interval {0} U [4>, 2ir). 
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Figure 5. (a) The location property: The two half-planes H mi 
and H m2 are candidate defining half-planes because they include 
the correct number of data points. The actual defining half-planes 
must lie in the two smaller intervals [0,/3 — tt) and (one in 

each) to ensure that the counting property, Property [3J remains 
valid, (b) Case 1: The counts of all half-planes when point q 
is inserted into region 1. The contour shown is the old contour 
of depth k/n. The depth of p and the defining half-planes for p 
remain unchanged by the counting property, Property [3J 

4.2. Possible Cases. Consider a data set T n before and after an update step, 
where point q is either inserted into or deleted from the data set. Assume point 
p has depth k/n and lies on a nontrivial contour before and after the update. In 
addition, assume that the defining half-planes for p with respect to T n are Hi 1 and 
Hi 2 satisfying the orientation described in the convexity property, Property [TJ Let 
the two defining half-planes for p after the update (in T n U {q} or T n \ {q}) be 
H mi and H m2 . In this case, lines l\ and li divide the plane into 4 regions, see 
FigurelUa). Call region (H h ) c n (Hi 2 ) c region 1. Call H h n {H h ) c region 2 and 
(Hi ± ) n Hi 2 region 3. Call Hi 1 DHi 2 region 4- Let v — qto be the unit vector from 
p towards q and let 7 be v's angle. 

Consider the possible cases created by inserting q relative to the four regions. 
Point q can be inserted only into the interior of one of the four regions (inserting 
into the boundary will violate general position) and deleted from the interior of 
any of the four regions or a defining edge. We first present an analysis of the cases 
created when a point q is inserted into the set. 

Case 1. Point q is inserted into region 1 (see Figure [5](b)): q is inserted into 
neither of the defining half-planes Hi t , Hi 2 , so < 7 < <fi — tt. Since the number 
of data points contained in Hi x and H\ 2 does not change, the depth of p remains 
k/(n+l) by the converse counting property, Property |4j By the nestedness property, 
Property [HI the new defining- half planes must be in the interval {0} U [<fi, 2tt). By 
the location property, Property [SJ the defining half-planes must also be in the 
intervals [0, <fi — tt) and (tt, </>]. The intersection of the two sets is {0, </>}. Since after 
the update p lies on a nontrivial contour, then H mi and H m2 are distinct so the 
defining half-planes for p remain unchanged. 
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Figure 6. Case 2: The counts of all half-planes when a point q 
is inserted into region 3. The contour shown is the old contour of 
depth k/n. The depth of p remains unchanged, but the old defining 
half-plane with angle tf> is no longer a defining half-plane. It is the 
first half-plane containing k + 1 points and whose boundary passes 
through two data points in the indicated region when traversed 
clockwise from the 7 + it angle. 

Case 2. Point q is inserted into region 2 or 3 (see Figure [6]): q is inserted 
into exactly one defining half-plane. Assume q is inserted into region 3, thus Hi 2 
contains an extra point, does not contain an extra point, and <p — it < 7 < tt. 
By the insertion property, Property [2] all half-planes with angles between 7 and 
7 + 7r include an additional point. Hi t will not contain an additional point, so 
the depth of p will remain k/(n + 1), but Hi 2 will include an extra point so it 
can no longer be a defining half-plane for p by the counting property, Property [3] 
By the nestedness property, Property the angle of vector Uff m2 must be in the 
interval U\ — {0} U [0, 2tt). By the insertion property, Property [2] every defining 
plane whose vector has an angle in the interval U2 — [7, 7+7r] includes an additional 
point, therefore, by combining the counting property, PropertyEJ the angle for Uff m2 
must be in the interval U± \ TJ% = [7 + tt, 2tt\. By the counting property, Property 
El v H m2 must be the first vector encountered when traversing the vectors starting 
from vn l2 counter-clockwise, whose bounding line passes through two points and 
associated half-plane contains k + 1 data points. 

Assume vn m2 has angle /3 and consider the location of the second defining half- 
plane H mi : By the location property, PropertyEl the associated vectors to the new 
defining half-planes must be in the intervals V\ = [0, /? — ir) and V2 — (tt, (3), so va m 
must be in the interval [0,/3 — 7r) since vn m2 £ (7r, /3] . However, by the nestedness 
property, Property [H the angles of the associated vectors for each of the defining 
half-plane must also be in Ui, but Ui H Vt = {0}. Therefore, the associated vector 
for H mi must have angle 0, and H mi =11^. 

Case 3. Point q is inserted into region 4 (see Figure [7]): q is inserted into 
both defining half-planes, thus n < 7 < cf>. By the insertion property, Property 
[2 every half-plane whose associated vector has angle in the interval [7,7 + it] 
gains an additional point. Since 7 > n, we safely replace this interval with U = 
[7, 2tt) U [0, 7 — 7r]. Combining these two ideas, we note that, in particular, every 
half-plane in the interval {0} U [(f), 2tt) gains an additional point. By the counting 
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Figure 7. Case 3: The counts all half-planes when a point q 
is inserted into region 4. The contour shown is the old contour 
of depth k/n. The depth of p increases by 1 and at most one 
half-plane can move into the indicated region. The new defining 
half-plane will be the first half-plane whose angle is either counter- 
clockwise from tt or clockwise from (j) — tt which contains k + 1 
points and whose boundary passes through two data points. 



property, Property [3J every half-plane whose vector has angle in the complement 
of U already contains at least k + 1 data points and every half-plane whose vector 
has angle in U, which previously contained at least k data points, now contains at 
least k + 1. Therefore the depth of p increases to (k + l)/(n + 1). 

The two defining half-planes of p relative to J- n , Hx x and Hi 2 contain k + 2 data 
points each. By the location property, Property [5J the angles of the associated 
vectors VH m , VH m must be in the intervals [0, 4> — tt) and (tt, 4>\. Assume that Hi 2 
is no longer a defining half-plane for p, then the angle of Vn m is in the interval 
(71", <j>). By the counting property, Property |3l combined with the insertion property, 
Property[2j every half-plane whose vector has angle in (7, 0) contains at least k + 2 
data points and whose bounding line passes through two data points must contain 
k + 3 data points and cannot be a defining half-plane for p. Thus, the angle for 
VH m must be in the interval (71", 7]. Assume vu m has angle /3. By the counting 
property, Property [3l vu m must be the first vector counter-clockwise from 7r to 7 
where m,2 passes through two data points and H„ l2 contains k + 2 data points. 

Now, assume that is also no longer a defining half-plane for p. Then by the 
location property, Property the angle for v mi , must be in the interval (0,(3 — n). 
By construction, f3 < 7, so (3 — tt < 7 — tt. By the counting property, Property 
[3l combined with the insertion property, Property [2l every half-plane whose vector 
has angle in (0,7 — tt) contains at least k + 2 data points and whose bounding 
line passes through two data points must contain k + 3 data points and cannot 
be a defining half-plane for p. However, since (0,/3 — tt) C (0,7 — tt) there is a 
contradiction since the angle for v mi must be in that interval. Thus v mi = vi ± and 
H mi = Hi 1 . 

The case where Hi t is no longer a defining half-plane is handled similarly. 

To describe the cases dealing with the removal of point q we will use the following 
theorem of independent interest that shows that the defining half-planes cannot 
change by much after an update. 
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Theorem 4.1. Let p be a data point which lies on a nontrivial half-space depth 
contour with respect to the data set J- n and assume that after a point q is inserted 
into (deleted from) J- n , p once again lies on a nontrivial contour. Then at least one 
of the two defining half-planes of p ( with respect to J- n ) remains unchanged with 
respect to the new data set T n U {q} (T n \ {q} ). 

Proof. By cases 1-3, when q is inserted into JF n , one of the defining half-planes 
remains unchanged so the theorem is proved for the cases of insertions. 

The claim is true when q is inserted, but assume, for contradiction, that it is 
false when q is deleted. Consider first deleting and then reinserting q. After these 
two updates, the data set returns to its original state and the defining half-planes 
for p must be unchanged. Moreover, by above, the reinsertion of q can only change 
one of the defining half-planes for p with respect to T n \ {q}. If the deletion of q 
changed both defining half-planes, it is impossible for the reinsertion of q to return 
the defining half-planes to their original state. This is a contradiction and therefore 
the theorem is proved for the cases of deletions. □ 

We now analyze the cases where the point q is deleted. In particular, Theorem 
14. H is non-constructive for deletions, but the constructions themselves are necessary 
for the description of an algorithm. For most of these cases, we determine the new 
defining half-planes by performing an insertion backwards and considering which 
type of insertion could result in the desired situation. 



Figure 8. Case 4: The counts of all half-planes when point q is 
deleted from region 1. The contour shown is the old contour of 
depth k/n. The depth of p remains unchanged and at most one 
half-plane will move into the indicated region. The new defining 
half-plane will be the first half-plane whose angle is either clockwise 
from <fi — 7r or counter-clockwise from 7r which contains k + 1 points 
and whose boundary passes through two data points. 

Case 4. Point q is deleted from region 1 (see Figure [8]): q is deleted from 
neither of the defining half-planes, thus < 7 < <f> — 7r. Consider the counting 
property, Property^ combined with the insertion property, Property [21 and con- 
sider a half-plane Hi whose associated vector vh, has angle (3. If < (3 < 7 or 
7 + 7r < f3 < (f), then q does not affect this half-plane and Hi contains at least k + 1 
data points both before and after q was removed and contains at least k + 2 data 
points if its bounding line passes through two data points. If 7 < (3 < 7 + n, then q 
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is contained in Hi and the number of data points in Hi decreases by one. Before q 
was deleted Hi contained at least k+1 data points and contained at least k + 2 data 
points if its bounding line passed through two data points. After q was deleted, 
there are at least k points in Hi and at least k + 1 data points in a half-plane whose 
bounding line passes through two data points. If <j> < /3 < 2tt, then q does not affect 
this half-plane and Hi contains at least k data points and k+1 data points if its 
bounding line passes through two data points before and after the deletion. Finally, 
if P = or a (H^s associated vector vh 1 is cither vh 1 or vh 2 )i then q does not 
affect this half-plane and Hi contains exactly k + 1 before and after the deletion. 
The number of points in some half-planes has decreased, but the half-space depth 
of p is still at least kj(n — 1) so the depth of p remains unchanged. 

The defining half-planes may change location. By the location property, Property 
the associated vectors to the new defining half-planes must be in the regions 
[0,0 — tt) and (it, <j>]. By the discussion above, we can shrink these regions to 
(-f,4> — tt) U {0} and (tt, 7 + tt) U {<j)}. By the counting property, Property [3l to 
locate these vectors one needs to traverse the region from <f> — tt towards 7 and from 
7r towards 7 + tt until the first vector whose associated half-plane passes through 
two data points and contains k + 1 data points is located in each direction. If none 
exist, then the defining half-planes remain unchanged. By Theorem 14.11 at most 
one defining half-plane will change its location. 



Figure 9. Case 5: The counts of all half-planes when point q 
is deleted from region 3. The contour shown is the old contour 
of depth k/n. The depth of p decreases and the half-plane with 
angle is no longer a defining half-plane. It is the first half-plane 
containing k points and whose boundary passes through two data 
points in the indicated region when traversed clockwise from 7 + tt 
to (j). 

Case 5. Point q is deleted from region 2 or 3 (see Figure [9} q is deleted from 
one of the defining half-planes. Assume w.l.o.g. that q is deleted from region 3. For 
the insertion of data point q to end up in region 3 (or 2) it must be inserted into 
region 4 (Case [3]). Insertion into region 4 increases the depth of p and may move 
one of the defining lines into the inside of the old contour. Therefore a deletion will 
decrease the depth of p. After the deletion, H ± will still contain k + 1 data points 
and have two data points on its boundary; also, Hi 2 will now contain only k data 
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points and have two data points on its boundary. Therefore, Hi x can no longer 
be a defining half-plane for p. In fact, since q must now correspond to a point in 
region 4, Hi t must change to a half-plane containing k points and whose boundary 
includes two points in the interval (4>,j + it]. To locate this vector, traverse the 
region from 7 + ir to <p m the clockwise direction to find the first vector whose 
associated half-plane satisfies these conditions. 




Figure 10. Case 6: The counts of all half-planes when point q 
is deleted from region 4. The contour shown is the old contour of 
depth k/n. The depth of p decreases and the defining half-planes 
for p remain unchanged by the counting property, Property [3] 

Case 6. Point q is deleted from region 4 (see Figure I10|) : q is deleted 
from both defining half-planes tt < 7 < <j). Since both Hi t and Hi 2 now contain k 
points and their bounding lines pass through two data points, the depth of p is now 
(k — l)/(n — 1). By the location property, Property the new defining half-planes 
must lie in the intervals [0, <j) — ir) and (7r, 0], with one in each interval. However, 
by the counting property, Property [3l every half-plane in the interval (0,(f>) whose 
bounding line passes through two data points contains > k+2 data points. After the 
deletion of q, these half-planes contain > k + 1 data points. Therefore, they cannot 
be defining half-planes and Hi ± and Hi 2 must remain the defining half-planes. 

Case 7. Point q is on a defining line (see Figure There are two possible 
cases when q is on a defining line. The two cases occur if q is in the same direction 
fromp as the cover contour edge, see Figure lTTT aK or if q is in the opposite direction 
as the contour edge, see Figure [Hfb). If q lies on the boundary between regions 

1 and 2 or regions 1 and 3, then q lies in the direction of the contour edge. For 
the insertion of data point q to end up in this position on a definition line, it must 
be inserted into region 2 or 3 (Case [2]). If q lies on the boundary between regions 

2 and 4 or regions 3 and 4, then q lies in the opposite direction from the contour 
edge. For the insertion of data point q to end up in this position on a defining line, 
it must be inserted into region 4 (Case [3]). In either case, the half-plane including q 
can no longer be a defining half-plane since its boundary line passes through only 
one data point. 

In first case, assume w.l.o.g. that q lies on the boundary between regions 1 
and 3, see Figure [TTta). Then Hi 2 is no longer a defining half-plane. By undoing 
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Figure 11. Case 7: The counts of all half-planes when point q 
is deleted from a defining half-plane, (a) The case where q is in 
the same direction as the defining edge and is deleted from the 
boundary between regions 1 and 3. The half-space depth of p 
remains unchanged, but the defining half-plane with angle </> is no 
longer a defining half-plane. The new defining half-plane is the 
first half-plane containing k + 1 data points and whose boundary 
passes through two data points in the counter-clockwise direction 
from 7r to <f>. (b) The case where q is in the opposite direction 
from the defining edge and is deleted from the boundary between 
regions 2 and 4. The half-space depth of p decreases, and the 
defining half-plane with angle 4> is no longer a defining half-plane. 
The new defining half-plane is the first half-plane containing k + 1 
data points and whose boundary passes through two data points 
in the counter-clockwise direction from (f> to 2ir. 

an insertion into region 3, Case [21 we can locate the new defining half-plane by 
searching in a counter-clockwise direction from 7r to to find the first vector whose 
associated half-plane contains k + 1 data points and whose boundary contains two 
data points. 

In the second case, the depth of p decreases, and assume w.l.o.g. that q lies on 
the boundary between regions 2 and 4, see Figure [TlTb'). Then H[ 2 is no longer a 
defining half-plane. By undoing an insertion into region 4, Case [31 we can locate 
the new defining half-plane by searching in a counter-clockwise direction from <f> to 
2ir to find the first vector whose associated half-plane contains k data points and 
whose boundary contains two data points. 

4.3. Algorithm. In this section, we augment the data structure presented in Sec- 
tion 3 to maintain not only the half-space depth of a point, but also the defining 
half-planes, provided that the data point is not on a degenerate contour. By in- 
specting the arguments in Section 14.21 we find that all that is needed is a search 
ability to find new defining half-planes, this operation is described here. Note that 
although all of the arguments in Section 14.21 assumed that the current position of 
the half-planes obeyed the convexity property, Property [1] it is straightforward to 
reinterpret and rotate the results for defining half-planes in arbitrary positions (e.g., 
a rotation is just a translation of the unit circle). 
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• Next(T, q, k) , Prev(T, q, fc) - Given a set of vectors T emanating from 
data point p, labeled with Depth, a vector q, and a depth value k, report 
the first vector in a counter-clockwise or clockwise direction, respectively, 
after q of Depth k (assuming such a vector exists). 

4.3.1. Detailed Operations. 

• To perform a Next(T, q, k) , first locate the leaf q by searching the tree, 
and, while searching the tree, update the Depth, subtree-addition, minDepth, 
and maxDepth fields of the children of the path as the tree is traversed. 
Now, traverse this path in the reverse direction. From bottom to top, ev- 
ery time there is a right child of the path that is not on the path, consider 

its minDepth and maxDepth values. If k is contained in this range, descend 
through this tree taking the path to the left child whenever k is included in 
the interval formed from its minDepth and maxDepth values and the right 
child otherwise. If we reach the root of the tree, without finding appropri- 
ate minDepth and maxDepth values, let left be the leftmost leaf in the tree 
and perform Next (T, left , k) (this step wraps around the break between 
2tt and in the unit circle). To perform Prev(T, q, k), perform the same 
operation as described above in Next, but with the roles of right and left 
reversed. 

Theorem 4.2. An algorithm exists that can dynamically maintain the half-space 
depth of a data point and maintain its defining half-planes, if defined, relative to a 
dynamic set of n points in 0(logn) time per operation and linear space. 

Proof. Maintain the structure presented in Section 3, augmented with the opera- 
tions from this section and update the defining half-planes using the operations as 
described in Section 14.21 □ 

Theorem 4.3. If p lies on a non-degenerate contour and the defining half-planes 
for a point p are known, then the new depth of p can be determined when q is either 
inserted into or deleted from the data set in 0(1) time. 

Proof. The cases in Section 14.21 describe all possibilities and can be decided in by 
comparing the point of interest to the defining half-planes. □ 

Thus far, we have avoided the degenerate case. It is possible to check whether 
this case occurs using the Next and Prev operations. We use the convexity property, 
Property [1] and the counting property, Property [3l to note that if p has depth k/n, 
then all meaningful half-planes with respect to p containing k + 1 data points and 
whose boundary contains 2 data points must have associated vectors lying in a 
semi-circle. One easy way to check this is as follows: Divide the unit circle into 
four intervals: [0,7r/2], [ir/2,ir], [ir,3ir/2], and [37r/2,27r]. Search for the extreme 
meaningful half-planes in each interval containing k data points. Then p is on a 
degenerate contour if and only if these vectors do not lie in a semi-circle. 

5. Dynamically Updating Rank Contours 

In this section, we keep a collection of data structures either from Section 3 or 
from Section 4 to maintain the depth of each point and then resort the data points 
according to their new depth. (There is a choice of data structures because the 
structures from Section [3] maintain the data necessary for this algorithm. Since the 
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structures from Section |4] are augmented versions of this data structure, they also 
maintain the necessary data.) We begin with a simple observation: 

Observation 5.1. When a data point is inserted into or deleted from a data set, 
the value of k in each point's depth, kjn, can increase or decrease by at most 1 step; 
i.e., for an insertion the new half-space depth is either k/(n + 1) or (k + l)/(n+ 1) 
and for a deletion, the new half-space depth is either kj (n — 1) or (k — — 1). 

The re-sorting step uses a list of buckets that hold all data points, where bucket 
k holds the list of data points of depth kjn. The list contains buckets from to 
[n/2j, the maximum number of contours for the current data set. When a new 
point is inserted or deleted, every point q that changes its depth must be moved: for 
example, if a data point changes its half-space depth from k' jn to [k! + l)/(n+ 1), 
then it is removed from its old bucket (k 1 ) and inserted into the new bucket (fc' + 1). 
Observation 1 5. II guarantees that the depth of q does not change by more then 1 and 
so the update takes constant time. The structure guarantees that the data points 
sorted by depth can be retrieved in linear time. 

To retrieve data points by x-coordinate, a second copy of the list of data points, 
sorted according to their x-coordinate and augmented with pointers to the depth- 
sorted list, is maintained. Every point in the depth-sorted list is augmented with 
a pointer to its bucket, which allows a constant time retrieval of the depth of any 
point and a total running time that is linear in the set size. 

5.1. Algorithm. Assume point q is inserted into or deleted from the data set. 

• Add or remove, if needed, a bucket to account for the new size of the data 
set. 

• For every point p G S 

— Update the data structure associated with p by incrementing or decre- 
menting the number of points in each of the half-planes containing 

q 

— If q was inserted into the data set, update p's data structure by 
inserting the two vectors associated with the meaningful half-planes 
determined by p and q, setting the corresponding to fields according 
to the direction to q, and labeling their Depth as the depth associated 
with their direction. The Depth can be computed by examining the 
neighboring half-planes' Depth and to fields. 

— If q was deleted from the data set, update p's data structure by 
Removing the vectors associated with the meaningful half-planes de- 
termined by q. 

— Update the depth of p by considering the value of minDepth at the 
root of the tree. 

— If the depth of p changed, remove it from its bucket and place it in the 
adjacent bucket, according to its new Depth. 

• If the data structure which was chosen for this algorithm is the one from 
Section 4 then, since this data structure also maintains the defining half- 
planes, the defining half-planes must be updated: 

— Compare q to the defining edges of p, and decide which of the cases 
described in Section 14.21 applies. If needed, locate the new defining 
edges as described in the cases in Section 14.21 
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— If p is the unique point with maximum depth (maximum when com- 
pared to the other points), then use Next and Prev to determine if 
p is on a degenerate contour as described in Section 14.21 If p is on a 
degenerate half-plane, update its defining half-planes to be null. 

— If p was on a degenerate contour, but is no longer on a degenerate 
contour, then use its data structure along with the converse counting 
property, Property [4j to determine its defining half-planes. 

• If q was inserted into the data set, build its data structure by sorting all 
vectors around q, assigning them the correct Depths, computing the half- 
space depth of q, and placing it in the correct bucket. 

• If q was deleted from the data set, remove it from the correct bucket. 

This data structure culminates in algorithms for maintaining the half-space depth 
of all points and the rank-based contours. 

Theorem 5.1. An algorithm exists that can compute the half-space depth ranking- 
order of a set of points dynamically in O(nlogn) time per operation and quadratic 
space. In addition, any rank-based contour can be reported in 0(n) time. 

Proof. Note that we do not actually maintain the contours themselves, but a struc- 
ture to report a contour efficiently. Theorem 15.11 implies that we can maintain 
the ranking-order of all points. Since we also have kept the data points in order 
sorted by x-coordinate and the two structures linked, in linear time, we examine 
each bucket to select the appropriate number of deep data points and then run 
Graham-scan on these data points. In this case, since the data points are already 
ordered by ^-coordin ate, the Graham-sca n for the upper and lower convex hulls 
takes linear time, see |de Berg et all . l2000lj . □ 



6. Dynamically Updating Cover Contours 

We present an algorithm for computing the cover-based half-space depth con- 
tours of a set of points dynamically in 0(n log 2 n) time and overall quadratic space. 
The algorithm is independent of the half-space depth and the rank-based con- 
tour algorithms. This algorithm is based on the intersection of half-planes as de- 
scribed in Corollary 12. 1 | and the relatio nship between half-plane intersections and 



convex hulls, see |de Berg et al. . 2000l p. 245]. Our method is based on the dy- 



namic computation of a convex hull, as described by Overmars and van Leeuwen 



Overmars and van Leeuwenl . Il98lj . Throughout this section, we assume that no 
two data points have the same x-coordinate. If two points have the same x- 
coordinate, then either the entire data set can be rotated so that no two points 
have the same ^-coordinate or, because the data set is in general position, a small 
(or infinitesimal) perturbation could be applied to one of these points. 



6.1. The Dual Arrangement. The algorithm is set in the dual space, see [de Berg et al 
2000l |: every point p = (p x ,p y ) is mapped to the line p* with equation y 



Px% Py 

and every line I given by the equation y — mx + b is mapped to the point I* with 
coordinates (m, — b), see Figure [T2"l The dual transformation operation is called 
T, and the dual transform has the following property: it preserves ordering, so if 
the point p is above the line I, then the line p* is below the point I* by the same 
vertical distance. This transformation is useful because the half-plane intersection 
problem is dual to the convex hull problem. Corollary 12.11 shows that in order to 
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maintain the cover-based depth contour of depth k/n, we should intersect all half- 
planes containing n — k + 1 data points and whose bounding line passes through 
two points. We begin by dividing this intersection in two, by considering the sub- 
set of these half-planes that open upwards and the subset of these half-planes that 
open downwards. In this discussion, we only consider the set of half-planes that 
open downwards because the set that open upwards can be treated similarly. We 
know that the intersection of these half-planes corresponds to an upper convex hull 
problem in the dual. We describe this hull next. 

In the primal, each of the half-planes of interest is a half-plane that is above 
n — k + 1 points and whose bounding line passes through two data points. For such 
a half-plane, the bounding line is strictly above n — k — 1 points. A line that passes 
through two data points corresponds, in the dual, to the intersection point h of two 
lines dual to data points. Since the bounding line was strictly above n — k — 1 data 
points, h is a point strictly below n — k — 1 lines dual to data points, see Figure [T2"1 
Also, any intersection point of two dual lines that is strictly below n — k — 1 dual 
lines corresponds to a half-plane containing n — k + 1 data points, whose bounding 
line passes through two data points, and which opens downward. Therefore, what is 
needed to compute the intersection of half-planes in the primal is the upper convex 
hull of these intersection points in the dual. 




Figure 12. (a) A set of 7 points in the primal plane and (b) 
the corresponding set of lines in the dual plane under the dual 
transformation T where the point p — (a, b) is taken to the line 
y = ax — b. In the dual plane, the dashed lines trace out the first 
and seventh levels (from top to bottom, respectively) while the 
dashed-dotted lines trace out the second and sixth levels (from top 
to bottom, respectively). The highlighted line in the primal, which 
passes through points D and E, defines two half-planes. This line 
has three points strictly above it and two points strictly below it. 
In addition, this line corresponds to dot at the intersection of T(D) 
and T(E) in the dual. This dot has the two lines corresponding to 
F and G above it while it has the three lines corresponding to A, 
B, and C below it. 
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In order to keep track of all of these intersection points with n — k — 1 lines 
strictly above them, we look to the (n — k + l)th level of the dual arrangement (the 
levels are numbered starting with 1). The (n— k+ l)th level is the closure of the set 
of all segments that have n — k lines above them, see Figure [12j The level consists 
of segments and intersection points. For any point on a segment, there are exactly 
n — k lines above it. For any intersection point, there are either n — /corn — k — 1 
lines strictly above it. Moreover, every intersection point with n — k — 1 lines strictly 
above it appears on level n—k+1. Finally, the upper hull of the intersection points 
on level n — k+1 is almost the upper hull of all intersection points with n — k — 1 lines 
strictly above them. The only caveat is that the endpoints, i.e., the first and last 
points on the hull could have n — k lines above them. This can be checked easily by 
considering the slopes of the two intersecting lines. If the slopes of the segments on 
the level increases, then there are n — k — 1 lines strictly above the intersection point 
and if the slope decreases, then there are n — k lines strictly above the intersection 
point. Therefore, our algorithm will maintain the upper hulls (and similarly, the 
lower hulls) of each level, which will determine the cover-based contours. 

6.2. Basic Data Structure. The algorithm uses a data structure for updating 
the convex hull of a set of points dynamically in Q(log 2 n) per update and in linea r 
overall space from Overmars and van LeeuwenL 1981 ; Dobkin and Souvaine , Il987l ]. 



In particular, we maintain each level of the arrangement of lines in the dual in 
such a data structure. The maintenance of the convex hull provides the contours as 
described above in Section 16.11 In this section, we will call a connected component 
of an arrangement where the ^-coordinates of the vertices are increasing a chain; a 
level of an arrangement is an example of a chain. Our operations, given a data set, 
are as follows: 

• Split (T,p) - Given a chain T from an arrangement and a data point p, 
compute the chains associated with the points to the left and right of p. 
This operation was part of the original data structure of Overmars et al. 

• Join(Ti, T 2 ,p) - Given two chains Ti and T 2 such that for every point 
Pi G Ti and P2 G T 2 , xcoord{p\) < xcoord(p) < xcoord(p2) ■ Compute the 
chain that connects the two chains Ti and T 2 via p. This operation was not 
needed for the dynamic update of a single convex hull and cannot be used 
for joining general hulls, but, in this case, the ordering of vertices of the 
chains allows for this operation. 

6.2.1. Detailed Operations. Our implementation uses a red-black tree augmented 
with the operations of a concatenable queue to enable additional dynamic operations 
to the data structure not required by the dynamic convex hull original algorithm: 

• A Split (T,p) is performed in 0(log 2 n) time as in Overmars et al.: Start 
from the root and descend down the search path towards node p, in each 
node the partial hull is disassembled and reassembled at its 2 children. 
After the hull has been split, the tree itself can be split and each portion 
can be rebalanced by local changes. 

• A Join(Ti, T 2 ,p) is performed in 0(log ra ) time. If the black-node height of 
tree T 2 is less than the black- node height of the tree Ti. Then T 2 is added 
as a subtree of 1\. After the trees have been joined, the partial hulls along 
the path of the join are then rebuilt. 
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6.3. Algorithm. Our algorithm computes the contours by computing the upper 
and lower convex hulls of the intersection points along the levels. Each intersection 
point is defined using the two lines of the arrangement that create the intersection. 
Two dynamic trees maintain the upper and lower convex hulls for every level of the 
arrangement, 2n trees for all levels and total of 0(n 2 ) space, the size of the entire 
arrangement. 

When a new point q is inserted into or deleted from the data set, some of the 
dual levels, the ones intersected by the line T(q), and consequently some of the 
convex hulls of these levels, are affected. We compute the new convex hulls by 
updating the convex hull trees. 

6.3.1. Insertions. Given a data point q (and its dual T(g)), a set of n points J- n , and 
its associated data structures, we wish to recompute the half-space depth contours 
of T n ^{q\- After each insertion, each oWlevel k, intersected by T(q), splits between 
levels k and fc+1. Therefore, the new level k becomes a combination of chains from 
old level k that remain in level k plus chains from old level k — 1 that increase level 
to level k minus chains from old level k that increase level to level k + 1. In every 
stitching point between these chains, a segment of T(q) is added. Not all levels 
are necessarily updated, only those intersected by T(q), i.e., the old levels between 
k' + 1 and n — k' are affected, where the half-space depth of q is k'/n before the 
insertion. Since T(q) intersects the arrangement exactly n times, 0(n) chains move 
between all trees and n segments from T(g) are added. 

To update the contours properly, we need to stitch the chains correctly. The 
stitching is done by splitting the dynamic trees representing the chains and merging 
the subtrees, while maintaining a correct representation of the convex hull on the 
root of each tree. 

6.3.2. Detailed Operations. 

• Compute the intersection point of T(q) with every line I of the arrangement 
and sort them according to x coordinate. Since there are only n other lines 
(and n intersections), this step takes 0(n log n). 

• Start a new chain with the leftmost segment of T(q), at the level correspond- 
ing to the order statistic of the slope T(q) with respect to the arrangement. 

• Increment all levels above this new level by 1. 

• Find all of the segments of T(q), and find the arrangement each will join 
when q is added to the data set. This can be done in 0(n) time since the 
difference in the level between two sequential segments is ±1. The choice of 
± is determined comparing the slopes of the two lines at the intersection. 

• Construct the new set of levels by splitting and joining trees, and inserting 
segments of T(q): Starting with the new level consisting of the leftmost 
segment of T(g), for each segment of T(q), in order, Split the appropriate 
chain at the intersection point (the right endpoint of the segment). Next, 
Join the top two chains and the bottom two chains at the split point. 
Finally, Join the remaining chain to the next segment of T(q). Continue 
this until there are no more segments of T(q) to attach. This step takes 
0(log 2 n) for each Split and Join, and there are a total of 0(n) of these 
steps (there are a constant number of steps for each segment of T(q)). This 
results in a total time complexity for the algorithm of 0(n log 2 n). 
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T(C) 
T(D) 




Figure 13. (a) The six levels of the arrangement before the ar- 
rangement is augmented with T(C). (b) T(C) has been subdivided 
where it intersects the other lines of the arrangement. Each interval 
(or ray) has been labeled according to its level in the new arrange- 
ment. The new level 3 has been started at T(C). (c) When the 
new level 3 intersects the old level 3, the chains switch so the right 
part of the old level 3 is stitched to the new level 3 and the left 
part of the old level 3 becomes new level 4. In addition, the next 
segment along T(C) is stitched to the new level 4. (d) When the 
new level 4 intersects the old level 4, the chains switch in a similar 
manner as to the stiching in (c). The next step will be to switch 
the new level 5 and the old level 5. 



6.3.3. Deletions. Given a set of points its associated data structures, and a data 
point q (and its dual line T(q)) we wish to recompute the structure associated with 
T n \ {q}. When a line T(q) is removed from the arrangement, level k splits between 
levels k and k — 1. Similar to the insertion case, the stitching of the levels must be 
done appropriately, while maintaining a correct representation of the convex hull 
at the root of each tree. 

6.3.4. Detailed Operations. 

• Compute the intersection point of T{q) with every line I of the arrangement 
and sort them according to x coordinate. Since there are only n other lines 
(and n intersections), this step takes 0(n log n). 

• Find all of the segments of T{q) in the arrangement, and find the level each 
will leave when q is removed from the data set. This can be done in 0(n) 
time since the difference in the level between two sequential segments is 
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±1. The choice of ± is determined by comparing the slopes of the lines at 
the intersection.- 0(n). 

• Construct the new set of levels by splitting and merging trees, and removing 
segments of T(q): Starting with the level consisting of the leftmost segment 
of T(q), for each segment of T{q), in order, Split both chains at the in- 
tersection point (the right endpoint of the segment). Next, Join the two 
chains which do not involve segments of T{q). Discard the chains consisting 
only of segments of T(q). Continue this until there are no more segments 
of T{q) to discard. This step takes O(kog 2 n) for each Split and Join, and 
there are a total of 0(n) of these steps (there are a constant number of 
steps for each segment of T(q)). This results in a total time complexity for 
the algorithm of Oin log n). 

• Decrement all levels which started below the leftmost segment of T(q). 

Theorem 6.1. It is possible to compute the cover-based half-space depth contours 
dynamically in 0(n log 2 n) time per update and 0(n 2 ) overall space. Moreover, any 
contour can be reported in O(m) time, where m is complexity of the corresponding 
levels in the arrangement. 

Proof. Note that we do not actually maintain the contours themselves, but a struc- 
ture to report a contour efficiently. In order to compute the cover-based contour of 
depth k/n, take the upper hull of the (n— k + l)th level and the and the lower hull 
of the fcth level, removing the endpoints if they are not appropriate, and returning 
these to the primal, where they represent the intersection of half-planes containing 
n — k + 1 data points and whose bounding line passes through two data points. 
Finally, the intersection of two convex regions needs to be performed, which can 
be done in the complexity of the hulls because the hulls are naturally sorted by 
slope, which corresponds to being sorted by x-coordinate in the primal plane. The 
convexity of the hulls is bounded by the convexity of the levels themselves. □ 



7. Conclusion 

In this paper, we provided data structures and algorithms for dynamically com- 
puting half-space depth of points as well as rank-based and contour-based contours. 
The depth of a single point can be maintained in O(logn) time per update using 
overall linear space. The rank-based contours can be maintained in O(nlogn) 
time per update using overall quadratic space. The cover-based contours can be 
maintained in 0(nlog 2 n) time per update using overall quadratic space. 

In addition, we studied the local cover-based contours near data points and 
described how they change as data points are added or deleted from the data set. 
In this direction, we proved a result of independent interest that at least one local 
cover-based contour edge does at a point does not change as points are inserted 
into or deleted from the data set. 

We end with the open problem of finding additional dynamic algorithms for other 



data depth functions as desired by the probability and statistics community, [Liu 



2003] . One would expect that this problem may be significantly harder than its 



half-space depth counterpart because most other data-depth fu nctions are not as 



well-behaved, in the fin ite sample cases, has half-space depth, see [Zuo and Serfling 
20001 : iBurr etall . l200fij . 
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